with R einforcemen T ) Learning Agents : A Preliminary Report

نویسندگان

  • Christopher Child
  • Kostas Stathis
چکیده

We present a framework for building agents that lea rn using SMART, a system that combines stochastic model acquisition with reinforcement lea rning to enable an agent to model its environment through experience and subsequently for m action selection policies using the acquired model. We extend an existing algorithm for automati c creation of stochastic strips operators (Oates et. al 1995) as a preliminary method of environment modelling. We then define the process of generation of future states using these operators a nd an initial state and finally show the process by which the agent can use the generated states to for m a policy with a standard reinforcement learning algorithm. The potential of SMART is exem plified using the well-known predator prey scenario. Results of applying SMART to this environ ment and directions for future work are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Preliminary report on a late Tortonian/Messinian balaenopterid cetacean (Mammalia, Mysticeti) from Sistan and Baluchestan Province (Iran)

In this study we present the first description of a mysticete skeleton from the late Tortonian to Messinian greyish-green marl of the Coastal Makran, south of Negour in Chahbahar County, Sistan and Baluchestan Province. This specimen is neither completely excavated, nor completely prepared, and therefore all our findings are preliminary. The identifiable components of this fossil thus far consi...

متن کامل

Isolation and characterization of the Enterococcus faecalis strain isolated from red tilapia (Oreochromis hybrid) in Indonesia: A preliminary report

The number of fishes were conducted a series of bacteriological examinations to confirm the clinical symptoms that appeared and led to streptococcal infection. Results of the external examination from two moribund red tilapia found the hemorrhagic traces in some parts of the body such as the cranial area near the mouth, eyes, operculum, and some body parts and erosion on the tail, pectoral, and...

متن کامل

A Preliminary Report of A Low-Dose Step-Up Regimen of Recombinant Human FSH for Young Women Undergoing Ovulation Induction with IUI

Background The aim of this study was to evaluate the efficacy and safety of a recombinant human follicle stimulating hormone (r-FSH) low-dose step-up regimen for controlled ovarian hyperstimulation in patients undergoing ovulation induction (OI) with intrauterine insemination (IUI). MaterialsAndMethods The study was conducted in the Department of Obstetrics and Gynecology, Far Eastern Memorial ...

متن کامل

Crop Land Change Monitoring Based on Deep Learning Algorithm Using Multi-temporal Hyperspectral Images

Change detection is done with the purpose of analyzing two or more images of a region that has been obtained at different times which is Generally one of the most important applications of satellite imagery is urban development, environmental inspection, agricultural monitoring, hazard assessment, and natural disaster. The purpose of using deep learning algorithms, in particular, convolutional ...

متن کامل

3D QSAR Studies of 1,3,4-oxadiazole derivatives as antimycobacterial agents

Recently several 1,3,4-oxadiazole derivatives were identified as potentially active antimycobacterial agents. Various 5-aryl-2-thio-1,3,4-oxadiazoles have been reported having good antimycobacterial activity against Mycobacterium tuberculosis H37Rv (ATCC 27294). In this paper we report 3D QSAR studies for the 41 molecules of 1,3,4-oxadiazoles by using k-Nearest Neighbor Molecular Field Analysis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003